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^ Abstract 

a 

^ In this paper we provide some sharp asymptotic results for a stochastic model of 

species survival recently proposed by Guiol, Marchado, and Schinazi. 

I — I 1 Introduction and statement of results 

Ph Recently, Guiol, Marchado, and Schinazi [7] proposed a new mathematical framework 

for modeling species survival which is closely related to the discrete Bak-Sneppen evo- 
d lution model. In the original Bak-Sneppen model [3J a finite number of species are 

^ arranged in a circle, each species being characterized by its location and a parameter 

' representing the fitness of the species and taking values between zero and one. The 

number of species and their location on the circle are fixed and remain unchanged 
J> throughout the evolution of the system. At discrete times n = 0, 1, ... , the species 

with the lowest fitness and its two immediate neighbors update simultaneously their 
l/-^ fitness values at random. The Bak-Sneppen evolution model is often referred to as 

CN an "ecosystem" because of the local interaction between different species. The distin- 

\C> guishing feature of the model is the emergence of self-organized criticality [H [H [6l |8] 

regardless the simplicity and robustness of the underlying evolution mechanism. The 
Bak-Sneppen model has attracted significant attention over the past few decades, but 
it has also been proven to be difficult for analytical study. See for instance ^ for a 
relatively recent survey of the model. 

The asymptotic behavior of the Bak-Sneppen model, as the number of species gets 
arbitrarily large, was conjectured on the basis of computer simulations in [3]. It appears 
that the distribution of the fitness is asymptotically uniform on an interval (/crit'^) 
for some critical parameter f^j-it > the value of which is close to 2/3 [H E] . 

Guiol, Marchado, and Schinazi [7] were able to prove a similar result for a related 
model with a stochastically growing number of species. Their analysis is based on a 
reduction to the study of a certain random walk, which allows them to build a proof 
using well-known results from the theory of random walks. The main result of [7| is 
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thus based on general properties of Markov chains, and suitable variations of the result 
can in principle be carried out to other similar models. 

In this paper we focus on the model introduced in [7J as is. Our aim is to elucidate 
the underlying mechanism responsible for the phenomenon described in [7] by sharp- 
ening the estimates that lead to the major qualitative statement therein. We proceed 
with a description of the Guiol, Marchado, and Schinazi (GMS) model. 

In contrast to the Bak-Sneppen model, the number of species in the GMS model is 
random and changes in time, and only the species with the lowest fitness is randomly 
replaced. The local interaction between species is not considered in the GMS model, 
and therefore the spatial structure of the population is of no importance. Let p > \ 
be given and denote q = 1 — p. Let Z_|_ denote the set of non-negative integers and 
let X = (Xn : n G Z_|_) be a discrete-time birth and death process with the following 
transition probabilities: from each site, Xn increases by 1 with probability p] from each 
site different than 0, X„ decreases by 1 with probability q = {1 — p); finally, at 0, Xn 
stays put with probability q. Thus X is a nearest-neighbor transient random walk on 
the integer lattice Z_|_ with holding times and reflection barrier at zero. Throughout 
the paper we assume that Xq = with probability one. 

The model describes the evolution of a population of species: newborn species are 
given "equal opportunity" by assigning them a random fitness, and the least fit species 
are the first to die. A jump to the right represents birth of a new species, whereas a 
jump to the left represents death of an existing species. Thus Xn represents the number 
of species alive at time n. When a new species is born, it is assigned a fitness. The 
fitness is a uniform [0, 1] random variable independent on the fitness of all previously 
born species as well as of the path of the process X. When X jumps to the left, the 
species with the lowest fitness is eliminated. We remark that, in a different context, a 
similar model was considered by Liggett and Schinazi in [9]. 

Fix / S (0, 1). We examine the model by considering two coupled random processes, 
L = (Ln '■ n £ Z+) (for lower or left;) and R = {Rn '■ n G Z+) (respectively, for right), 
where L„ denotes the number of species alive at time n whose fitness is less than / 
while Rn denotes the number of the remaining species alive at time n. 

Observe that L„ increases by 1 if Xn does and the newborn species has fitness less 
than /, and Ln decreases by 1 whenever Xn decreases by one and L„ is not zero. The 
value of Ln remains unchanged when either Xn increases by 1 and the newborn species 
has fitness at least / or Xn decreases by 1 and Ln = 0. 

Notice that when it is not at zero, the process L evolves as a nearest-neighbor 
random walk with probability pf of jumping to the right, probability q of going to the 
left, and probability 1 — pf — q of staying put. When at zero, Ln jumps to the right 
with probability pf, and stays put with the complementary probability 1 — pf. Thus 
L is a Markov chain. Since p > q, it is positive recurrent if pf < q, null-recurrent if 
pf = q, and is otherwise transient. In what follows we will denote the critical value 
q/p oi the parameter / by /c. 

Consider the process B = (Bn '■ n G Z-|_), where Bn is the total number of species 
born by time n with fitness at least /. Observe that S is a non-decreasing Markov 
chain (formed by sums of i.i.d. Bernulli variables), which jumps one step up with 
probabilities p{l — f) = p — q and stays put with the complementary probability 2q. It 
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is shown in [7] that when f = fc, have that for any e > 

Bji — Rn 2 

0<hmsup — . <-, a.s., (1) 

n^oo q 

while each species with fitness less than the critical value disappears after a finite 
(random) time. It follows from ([T]) that the distribution of species with fitness higher 
than fc approaches a uniform law. 

We sharpen the above result by proving the following theorem. Recall that fc = q/p. 

Theorem 1. Suppose that f = fc- Then 

n -I 

1. limsup = = 1, a.s. 

n-!>oo V 4gn m In n 

2. — " " =^ 1-^(0, 1)1, where N(0, 1) denotes a mean-zero Gaussian random vari- 
able with variance one, and =^ stands for convergence in distribution. 

3. {Ln : n £N) is a recurrent Markov chain (visiting zero infinitely often) and hence 
each species with fitness less than the critical value becomes eventually extinct with 
probability one. 

The proof of the theorem is included in Section [2} Notice that by the law of large 
numbers i?„ ~ n{l — fc), a.s., as n — >• oo. Moreover, the fitness of any randomly chosen 
species from Bn is distributed uniformly on {fc, 1). Hence the theorem implies that this 
uniform law on {fc, 1) is the limiting distribution for the fitness of a randomly chosen 
species being alive at time n and having fitness larger than fc- 



2 Proof of Theorem [H 

There is no loss of generality assuming that X is obtained recursively from an i.i.d. 
sequence of Bernoulli random variables J = (Jn : n G Z+) with P{Jn = 1) = q, 
P{Jn = 0) = p, as follows: Xq = and for n G Ijj^ we have 

Xn+i = Xn + {1- Jn) " " Sn), where Sn := 1{X„=0}- (2) 

Let G = {Gn ■ n G Z+) be a Markov chain formed by the pairs G„ = J„). Figure 
[l] illustrates the transition mechanism of G. 



2.1 Reduction from — Rn to an occupation time of G 

Let A = (A„ : n G Z_|-) be the process defined through A„ = Bn — Rn- Notice that A- 
increases by 1 if and only if i?„ decreases by 1, and otherwise stays put. That is 

A„+i - A„ = l{x„+i-x„=-i}l{L„=o}- 
Thus by Q, A„+i - A„ = (1 - s„) J„l{2,„=o}, and we have 

n— 1 71—1 n— 1 

An = ^(1 - Si)Jil{i^=o} =^n-^ JiSi, where := ^ Jjl{2„^=o}- 

1=0 i=0 i=0 
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Figure 1: Transition probabilities for G 



Observe that rjn is the occupation time (number of visits) of G at (0, 1) up to time n — 1 
and that Y17=o ^i'^i bounded above by X^^gSj. Since X is transient and Xq = 0, 
the latter series can be represented in the form X^^g ~ Sf=i^«> where g is the 
total number of visits to zero (distributed geometrically, according to P{g = k) = 
/^~^(1 — /c), /c G N, see for instance [H p. 274]) and hi are successive holding times 
of X at zero (i.i.d. sequence of geometric random variables, independent of g, with 
P{hi = k) = q^~^p, A: G N). In particular. 



Since L and consequently G are recurrent (and thus A„ is a non-decreasing sequence 
converging to +oo with probability one), it therefore suffices to show that Theorem [l] 
holds with An = Bn — Rn replaced by rjn in its statement. 

Excursion decomposition for the path of L. We decompose the path of L into 
a sequence of successive excursions from 0, each one lasting until (but not including) 
the first time L returns to from 1. Observe that once the Markov chain L is at zero, 
it will stay put until = and the fitness of the newborn particle is less than fc- 
Therefore, the holding time of L at zero during one excursion is a geometric random 
variable with parameter pfc = q. Let /i^ be the i.i.d. sequence of holding times at 
zero during successive excursions of L from zero. That is, P{hk = n) = p"'~^q, n G N. 
In what follows we will use the notation Geom(a) for the geometric distribution with 
parameter a (for instance, we could write hk ~ Geom(g)). Let ak be the A:-th excursion 
starting time, that is ui = and 
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(X^^iJi) <E{g)-Eihi) 



p{l - fc) 2p - 1 



< oo. 



i=0 



Uk = inf{n > cTfc-i : Xn-i = 1, X„ = 0}, A; = 2, 3, . . . 
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Notice that {fi^ : A: S N) is an i.i.d. sequence and 

Nn N„ + l 

^/^ife <r]n< ^ fJ'k- (3) 

k=l k=l 

We now compute /i := E{^k) using the fact that the sequence of pairs G„ = {Jn,Ln) 
forms a Markov chain, the transition mechanism of which is iUustrated in Fig. [T] The 
value of /U is equal to the expected number of visits by this Markov chain to the state 
(0, 1) during the period of time starting at the state (0, 1) with probability q and at 
(0, 0) with probability p, and lasting until G leaves the set {(0, 0), (0, 1)}. We thus have, 
using first step analysis, 

H = q{l + H)+ p{l - fc)^i + p/c • = g + /i(l - pfc) =q + fip. 

Hence /x = 1. Consequently, using ([s]) and the law of large numbers, we obtain 

rjn ~ Nn as n — >■ oo, a.s. (4) 



Reduction to a simple random walk The duration of the A;-th excursion of L 
from zero is equal to the sum hk + ctfe of the holding time at zero hk distributed as 
Geom(g), and the time that it takes for L to return from 1 to zero. Notice that 

ttfc = CTfe - (o"fc-i + hk- 1). 

For m G N let Vm = YlT=iihk + Ofe) be the total duration of the first m excursions 
of L from 0. Then Nf^ = max{m : Vm < k}. With each excursion we can associate 
a skeleton, which is the path obtained from the excursion by omitting all transitions 
from a state to itself, for all states. If we let denote the length of the skeleton, 
then due to the choice of fc it follows that has the same distribution as the time 
required for the simple symmetric random walk on Z to get back to starting from 
0. Furthermore, the time spent by L at each visit to a site is a geometric random 
variable, Geom(2g) for sites different than and Geom(q) for 0. Therefore, the length 
of a single excursion itself is a sum of one Geom(g) random variable plus a sum of 
Tfc — 1 independent Geom(2g) random variables. If we replace with a Geom{2q), 
then the resulting modified "excursion time" becomes a sum of copies of a Geom(2g) 
random variable. Let denote the total length of the first m excursions modified in 
this way. Let N'f^ denote the number of such excursions occurred by time k, that is 
N'f, = max{m : < k}. Then Vm stochastically dominates Vm- By the law of large 
numbers, 

lim YULZJ^ = E{h,) - Eih[) = 1, (5) 
- • — m Zq 



where h[ is a geometric random variable with parameter 2q. Letting Tm = YlT=i '^k, 
we obtain 



k=l 
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where (/i^ : A; S N) is an i.i.d. sequence of random variables, each one distributed as 
Geom{2q). Thus, by the law of large numbers, 

Tm 

V' ~ — as m — >■ oo, a.s. (6) 

Notice that is distributed the same as the total length of the first m excursions 
from zero of a simple symmetric random walk. 

2.2 Completion of the proof: CLT and LIL for r]n 

LIL for rjn We need the following result. Although the claim is a "folk fact", we 
give a short proof for the sake of completeness. 

Lemma 1. 

Tm 

liminf — — = 1 a.s. 

m-!>oo m^/ (2 In mm) 

Proof of Lemma^ Let S = {Sn '■ n E Z_|_) denote the simple symmetric random walk 
on Z. Let 70 = and define inductively 7m+i = inf{A: > 7^ : 5*^ = m + 1}. Let 
Ym = 7m+i —7m with the usual convention that the infimum over an empty set is +00. 
Let (f>{x) = V 2x In In x for x > 0. By the law of iterated logarithm for S, 

1- ,. S-y n 

lim sup = iim sup — — r = lim sup 



(t>{n) n^oo (t>{ln) n^oo Hln) 

Since (p^^{x) ~ x^/(21nlnx) as x — )• 00, we obtain 

liminf = 1. 

n->oo n^/(21nlnn) 

Observe that 7^ = X]"=o^^t- The distribution of ti, the time to return to starting 
from for S, is equal to the distribution of t'^, the time to return to starting from 
for the reflected random walk Since t'^ is equal in distribution to 1 + Yi, 

, . . ^ T-ui , 7n ~\~ n 

limmi — ^TT^ — ; — ; ^ = iim luf 



m-s>oo m^/(21nlnm) n^oo n2/(21nlnn) ' 
completing the proof of the lemma. □ 
Using the lemma along with ^ and Q, we obtain 

m-^oo 771^/(2 In mm) m-s>oo m^/ (2 In mm) 
Consequently, since is the inverse of sequence of V, we obtain 

lim sup = = 1, a.s. 

fc_^oo V In In k 

Combining this with Q completes the proof of the law of iterated logarithm for rjn- 
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CLT for rjn We now turn to the proof of the central limit theorem. It is well known 
(see for instance [5, p. 394]) that 



lim ii;(e-^^'"/™ ) = E(e-^^'), 9 > 0. 
Therefore, it follows from ([s]) and ([g]) that 

lim ^(e-^^-/™') = lim ^(e-^^-/'"") = E(e 



The function — J- e ^^^^e^ > 0, is the Laplace transform of a positive stable law with 
index 1/2 whose density function is given by (see for instance O p. 395]) 



ce '^'Z^" 



</'c(m) = l{„>o}^==-- 



We intend to use this formula with c = Set Vq = and observe that for all A; G N, 



P(iVfe < n) = Po(n«J > 
Fix s > and let u = Vks. Then 

PiN, < VSs) = Pd-^^, > t) = P(^|^ > 4,(1 + o(l))) ^.(.)<iu. 
Differentiating the right-hand side we obtain 

rs 2e-«V(2c-2) 



lim P(iVfe < VA;s) = / du. 

k^'^ Jo V27rc~2 



Therefore N^/Vk converges weakly to the absolute value of a centered normal random 
variable with variance equal to = 2q. Combining this with Q completes the proof 
of the central limit theorem for □ 
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